home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
ftp.cs.arizona.edu
/
ftp.cs.arizona.edu.tar
/
ftp.cs.arizona.edu
/
icon
/
newsgrp
/
group98b.txt
/
000017_icon-group-sender _Wed May 6 16:37:16 1998.msg
< prev
next >
Wrap
Internet Message Format
|
2000-09-20
|
3KB
Return-Path: <icon-group-sender>
Received: from kingfisher.CS.Arizona.EDU (kingfisher.CS.Arizona.EDU [192.12.69.239])
by baskerville.CS.Arizona.EDU (8.8.8/8.8.7) with SMTP id QAA17698
for <icon-group-addresses@baskerville.CS.Arizona.EDU>; Wed, 6 May 1998 16:37:16 -0700 (MST)
Received: by kingfisher.CS.Arizona.EDU (5.65v4.0/1.1.8.2/08Nov94-0446PM)
id AA14346; Wed, 6 May 1998 16:37:12 -0700
Message-Id: <s550a4ba.077@housmtp.oceaneering.com>
X-Mailer: Novell GroupWise 4.1
Date: Wed, 06 May 1998 17:55:54 -0500
From: Charles Hethcoat <CHETHCOA@oss.oceaneering.com>
To: icon-group@optima.CS.Arizona.EDU
Subject: AI use for Icon -Reply
Mime-Version: 1.0
Content-Type: text/plain
Content-Disposition: inline
Errors-To: icon-group-errors@optima.CS.Arizona.EDU
Status: RO
Content-Length: 1963
Terry Murray <tmurray@primary.net> wrote:
>...message would come form a supervisor to the human resources
department asking them to promote Mr. Any Body to a New Title on a
certain date. By having all the employees names prior to running the
program I weight each name based on its reletive position to the other
name within the organization to get a fuzzy match.
If you get a fuzzy name match algorithm together, it sure would be a nice
contribution to the Icon library...I don't know much about the methods
used in these types of searches, but I'll give it some thought also.
>I'm now working on the 'when' to search for dates. I need a different
method than giving the program every possible date in every possible
way.
My first impulse says to try to find candidates for the elements first: year,
month, day of week, and day written in every possible way. Then dope
out month from day separately, based partly on proximity, and then put
the date in a canonical form such as yyyy-mm-dd.
December could be written (in English) as Dec or December (possibly
misspelled), or as 12. Years could be 1998 or 98 or '98. Days could be
1..N where N is a well-known function of month (28..31).
By "proximity" I mean, for example, that a numeric month should be
followed very shortly afterward by viable candidates for year and day,
or it's probably not a date.
A date like 7/9 is ambiguous internationally: July 9 in the U.S. but
September 7 in Europe. This can usually be resolved by knowing the
origin of the writer.
Delimiters seem only to be slashes, dashes or spaces.
Seems like a job tailormade for Icon string scanning and tables.
>Does someone have a program that would pick a date out of a
sentence?
Sorry, I don't. It would make a another great library contribution.
Charles Hethcoat
Senior Engineer, Analysis Department
Oceaneering Space Systems, Inc.
16665 Space Center Blvd., Houston, Texas 77058 USA
chethcoa@oss.oceaneering.com